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Embedding data in material and removing the embedded data 



The present invention relates to embedding data in material and removing the 
embedded data. Embodiments of the invention relate to watermarking of material. 

In this application, material means any one or more of video material, audio 
material and data material. Video is -generic to still and moving images. 

Steganography is the embedding of data into material such as video material, 
audio material and data material in such a way that the data is imperceptible in the 
material. 

Watermarks are data embedded in material such as video material, audio 



A watermark may be used for various purposes. It is known to use watermarks 
for the purpose of protecting the material against, or trace, infringement of the 
intellectual property rights of the owner(s) of the material. For example a watermark 
may identify the owner of the material. 

Watermarks may be "robust" in that they are difficult to remove from the 
material. Robust watermarks are useful to trace the provenance of material which is 
processed in some way either in an attempt to remove the mark or to effect legitimate 
processing such as video editing or compression for storage and/or transmission. 
Watermarks may be "fragile" in that they are easily damaged by processing which is 
useful to detect attempts to remove the mark or process the material. 

Visible watermarks are useful to allow e.g. a customer to view an image e,g. 
over the Internet to determine whether they wish to buy it but without allowing the 
customer access to the unmarked image they would buy. The watermark degrades the 
image and the mark is preferably not removable by the customer. Visible watermarks 
are also used to determine the provenance of the material into which they are 
embedded. 

When material has been watermarked it is desirable to be able to remove the 
mark. However, embedding a watermark in the material in such a way as to make 
difficult unauthorised removal may also have the consequence that the watermark is 
difficult to remove by an authorised person. 




ite^rmark-may^be-impercepti bl e or ] 




material. 
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According to a first aspect of the invention there is provided: - 
a) A method of removing data embedded in material comprising the steps of: 
receiving material in which data is embedded; 

accessing an information store storing information enabling the data to be 
5 removed; and 

removing the said data using the enabling data accessed from the store. 



b)A method comprising the steps of: 

embedding data in material; and 
10 storing in an information store information for enabling the data to be removed 

from the material. 

c) Apparatus for removing data embedded in material comprising: 
an input for receiving material in which data is embedded; 
1 5 an information store for storing information enabling the data to be removed; 

and 

a remover arranged to remove the said data using the enabling data 
accessed from the store. 

20 d) Apparatus comprising: 

an embedder for embedding data in material; 

a store for storing information for enabling the data to be removed from the 
material; and 

a generator for generating the enabling information when the said data is 
25 embedded in the material. 

The provision of the stored enabling data allows access to enabling data which 
allows the embedded data to be removed. In preferred embodiments of this aspect of 
the invention, the enabling data is stored in a manner which is secure against 
unauthorised access to it. Methods of preventing unauthorised access to secure data 
30 are well known. 
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It has also been found that, in for example video material, embedding 
watermarks in identical fashion in different images results in differing degrees of 
difficulty in removing the watermarks. 

According to a second aspect of the present invention, there is provided a 
method-comprising-the-steps-of: _ _ 

embedding data in first material to produce second material in which data is 
embedded; 

removing the data from the second material to produce recovered material; 

comparing the first and recovered material to determine the differences and 
4ocations-o£differences4her-ebetween;-^and 

storing corrections which correct the said differences and data identifying the 
said locations at which the differences occur in the first material. 

The second aspect also provides apparatus for embedding data in material 
comprising: 

an embedder for embedding data in first material to produce second material in 
which data is embedded; 

a remover for removing the data from the second material to produce recovered 
material; 

a comparator for comparing the first and recovered material to determine the 
differences and locations of differences therebetween; and 

a store for storing data identifying the said locations and corrections which 
correet the said differences. 

According to a third aspect of the present invention there is provided a method 
of removing data embedded in material, the data being embedded in the material 
according to the embedding method of said second aspect, the removing method 
comprising the steps of: 

removing the data from the second material to produce recovered material; 

deriving the said corrections and locations from a store storing data identifying 
the said locations and corrections which correct the said differences; and 

using the corrections to correct the recovered material at the said locations. 
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The third aspect also provides apparatus for removing data from material in 
which the data has been embedded by the apparatus of said second aspect, the 
removing apparatus comprising: 

a remover for removing the data from the second material to produce recovered 
5 material; 

a deriver for deriving the said corrections and locations from the said store; and 

a corrector arranged to use the stored corrections to correct the recovered 
material at the said locations. 

The invention also provides a system comprising a combination of the 
10 apparatus of the said second aspect (herein after a data embedder) and the apparatus of 
said third aspect (herein after a data remover. 

Ideally, the aspects of the invention operate optimally with a lossless channel 
between the embedding apparatus and the removing apparatus. If the channel between 
the embedding apparatus and the removing apparatus is known, an emulator emulating 
1 5 that channel may be provided in the embedding apparatus between the embedded and 
remover of the embedding apparatus. 

For a better understanding of the present invention, reference will now be made 
by way of example to the accompanying drawings in which: 

Figure 1 is a schematic block diagram of an illustrative watermark embedding 
20 and removal system in accordance with the present invention; 

Figure 2 is a more detailed schematic block diagram of an embedder of the 
system of Figure 1; 

Figures 3 A and B illustrate an example of a window of coefficients and how 
the window relates to a coefficient Ci being modified to embed a bit of a watermark; 
25 Figure 4 is a flow diagram of a method of calculating strength a in accordance 

with an example of the invention; 

Figure 5 is a schematic block diagram of a watermark decoder; 
Figure 6 is a schematic block diagram of a watermark remover; 
Figure 7 is a flow diagram of a method of calculating strength a in accordance 
30 with an example of the invention; 

Figure 8 is a schematic diagram of an alternative, illustrative, set of coefficients 
usable to calculate a; 
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Figures 9 and 10 are diagrams used herein below to describe wavelets 
transforms; and 

Figures 1 1 and 12 are diagrams of data structures of UMIDs. 
Overview 

5 Figure 1 illustrates a system, in this example a watermarking system, for 

embedding a watermark onto an image I and for recovering and removing it from the 
image. The watermarking system 10 comprises: a source 110 of the image I;, a sub- 
system A for applying the watermark; a sub-system B for detecting and removing the 
watermark; a 'lossless' channel C linking the sub-systems A and B; and a database D. 

10 In overview, the subsystem A applies the watermark to the image. The 

watermarked image is transmitted via the lossless channel C to the subsystem B where 
the watermark is detected and removed. It is desired to restore the original image in the 
subsystem B with no degradation. For that purpose, at the subsystem A the watermark 
is removed from the image by a remover 130 and the thus restored image is compared 

1 5 with the original image I to detect differences and the locations within the image of the 
differences. The database D stores the locations of differences and correction data 
which may be values of the original image at those locations or the differences. The 
subsystem B detects and removes the watermark to produce a substantially restored 
image. The removal process is the same as at the subsystem A. The locations of 

20 corrections and the corrections are read from the database and the corrections applied 
to the restored image to correct it. In practice the system will operate on many images. 
Thus it is necessary to identify the images and the correction data associated therewith 
in the database D. In this example each image is identified by an identifier which is 
used to access the relevant data in the database. Preferably the identifier is a UMID. 

25 UMIDs are described in the section UMIDs below. Most preferably the watermark 
comprises the UMID. In the example of Figure 1, the image is transformed by a 
wavelet transform. Wavelet transforms are discussed in the section Wavelets below. 
In the example of Figure 1 wavelet coefficients are compared to determine the said 
differences. 

30 First Example 

Referring to Figure 1, the source 1 10 produces a spatial domain image I. An 
embedder 120 receives a UMID from a generator 115 and embeds the UMID as the 
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) watermark in wavelet coefficients Ci of a wavelet transform (T) of the image 

according to 

Ci' = Ci + a.Ri 

Where Ci is the I th original image coefficient, Ci' is the I th modified coefficient, Ri is 1 th 
5 bit of watermark data and a is a scaling factor. As described hereinafter Ri may be a 
bit of a pseudo random symbol sequence modulated by the UMID data Wi. 

The remover 130 removes the watermark to produce restored coefficients Ci" 
according to 

Ci" = Ci' - <x.Ri 

10 A comparator 125 compares the restored coefficients Ci" with the original 

coefficients Ci to determine any differences and the locations thereof. The differences 
and locations i are stored in the database D together with the UMID generated by 
generator 115. 

The coefficients Ci' produced by the embedder 120 are inverse transformed 
1 5 (T -1 ) and applied to the channel C. 

The subsystem B receives the watermarked image from the channel C. A 
transformer T recreates the coefficients Ci'. The UMID is detected and removed by a 
detector and remover 127. The detected UMID is used to access the correction data in 
the data base and the correction data is applied to the image coefficients Ci" in a 
20 corrector 129 to restore the original coefficients Ci which may be inverse transformed 
( T l )to restore the original image I. 

. ^ In this first example, a is a fixed predetermined value. Using a fixed 
predetermined value of a facilitates removal of the watermark. 
Second Example 

25 The second example is identical to the first except that a channel emulator 121 

is provided between the embedder 120 and remover 130. The emulator applies, to the 
output of the embedder, a channel emulator function emulating the effect of channel C 
on the output of the embedder. 

The channel emulation 121 emulates the channel C. Thus errors introduced by 
30 the channel C can be detected and corrections stored in the database D. 
This is useful especially if the channel C is lossy. 
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) Third Example 

This modifies the first or second example in that a is not fixed. The embedder 
120 is described with reference to Figures 2 to 4 and the remover 130, 127 is described 
with reference to Figures 5 to 7. 
5 Figure 2 illustrates the watermark embedder 120 in more detail. The 

watermark embedder 120 comprises pseudo-random sequence generator 220, an error 
correction coding generator 200, a wavelet transformer 210, an inverse wavelet 
transformer 250, a first combiner 230, a data converter 225 and a second combiner 
240. The wavelet transformer 210 includes a frame store FS1. The inverse 
10 transformer 250 includes a frame store FS2. The frame store FS1 stores a frame of 
unmodified coefficients Ci. The frame store FS2 stores a frame of modified 
coefficients Ci'. 

The error correction coding generator 200 receives the UMID and outputs an 
error correction coded UMID to the first combiner 230. The pseudo-random sequence 
1 5 generator 220 outputs a pseudo-random binary sequence (PRBS) Pi, where i is the i* 
bit of the sequence, to the first combiner 230. The PRBS has a length L x J of bits 
where J is the number of bits in the error correction encoded UMID. Each bit j of the 
error correction encoded UMID then modulates a section of length L of the PRBS. The 
first combiner 230 logically combines the error correction encoded UMID with the 
20 PRBS to produce a watermark having bits Ri. A bit Wj=0 of the error correction 
encoded UMID inverts L bits of the PRBS. A bit Wj=l of the error correction 
encoded UMID does not invert the PRBS. Thus bits Wj of the error correction encoded 
UMID are spread over L bits of the PRBS. The data converter 225 converts binary 1 to 
symbol +1 and binary 0 to symbol -1 to ensure that binary 0 bits contribute to a 
25 correlation value used in the decoder of Figure 5. 

The wavelet transformer 210 receives the video image I from the source 110 
and outputs wavelet coefficients to the second combiner 240. Wavelets are briefly 
discussed in the section Wavelets below. 

The second combiner 240 receives the watermark Ri, the wavelet coefficients 
30 Ci and watermark strength cri and outputs modified coefficients Ci' where 

Ci'= Ci + cri Ri 
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The inverse wavelet transformer 250 receives the modified coefficients Ci' and 
outputs a spatial domain watermarked image F. 

The embedder includes an ECC generator 200. The use of error correction 
coding to produce an error correction coded UMID is advantageous since it allows the 
5 UMID 175 to be reconstructed more readily should some information be lost. This 
provides a degree of robustness to "future processing or attacks "against" the watermark. 
The use of a pseudo-random sequence Pi to generate a spread spectrum signal for use 
as a watermark is advantageous since it allows the error correction coded UMID 205 
to be spread across a large number of bits. Also, it allows the watermark to be more 

10 effectively hidden and reduces the visibility of the watermark. Applying the 
watermark to a wavelet transform of the image is advantageous since this reduces the 
perceptibility of the watermark. Furthermore, the strength of the watermark is 
adjusted by ai to control the visibility of the watermark. 

The operation of the error correction code generator 200 will now be described. 

1 5 The error correction code generator 200 receives a UMID. Typically the UMID will 
be a binary sequence of 31 bytes. The error correction code generator 200 typically 
outputs a 511 bit error correction coded binary sequenfce. Various error correction 
coding schemes are known. One approach uses BCH coding which corrects up to 31 
bit errors. The error correction rates can be further improved by using knowledge of 

20 the UMID format to help correct errors. One such approach is to check for invalid 
dates times GPS locations etc. 

The watermark is preferably embedded in predetermined regions of the wavelet 
transformed image. Most preferably the upper horizontal (hHJV)and upper vertical 
(lH,hV) bands are used. These bands are chosen as watermarks embedded in these 

25 regions are not readily perceptible. The length of the pseudo-random sequence may be 
chosen such that the watermark fills the predetermined regions in each wavelet image. 
Calculating a. Figures 3 and 4. 

In accordance with an illustrative embodiment of the invention, for each 
coefficient Ci, a value of a, ai is calculated, ai is calculated as 

30 ai = F {Cn}i, 
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where {Cn}i is a set of unmodified wavelet coefficients excluding Ci, which set may 
vary with i, that is respective values of ai are functions F of respective sets {Cn}j. This 
is shown as step S8 in Figure 4. 

The coefficients Qii of the set may be in the same wavelet band as Ci or may 
be in different bands from Ci and from each other as described below with reference to 
Figure 8. 

If the coefficients are in the same band as Ci, they are preferably in a window 
adjacent Ci. For example the set comprises N coefficients Ci-1 to Ci-N as shown in 
Figure 3 and the embodiment will be described in the following with reference to that. 

The number N of coefficients may vary with Ci; thus for generality N is 
denoted as Ni. 

The function F may be any suitable function. In this illustrative embodiment F 
is such that 

ai = F{Cn}/ = -^.^C„ 2 for n = i-l to i-N for Ni*0 and ai = k for Ni = 0. 

1 5 Figure 3 A is a map of wavelet coefficients in a frame store 300, the coefficients 

being in level 1 of a wavelet transform. In a preferred embodiment, the coefficients Ci 
are modified only in the upper horizontal hH, IV and upper vertical 1H, hV bands to 
embed the watermark. However, coefficients in other bands and/or in other levels may 
be modified to embed a watermark. In the following only band hH, IV is considered. 

20 The wavelet coefficients are stored in the frame store 300 (also denoted FS1 in 

Figure 2) and in this example are stored as shown in Figure 3A grouped in the bands. 
The coefficients are serially ordered. For example they may be serially ordered by a 
raster scan thereof Other scanning patterns are known. Assuming serial ordering of 
the coefficients in each band, for each coefficient Ci to be modified, there is defined a 

25 set {Cn}i (herein also referred to as a 'window') of Ni coefficients excluding Ci. The 
set {Cn}i consists of the Ni coefficients Ci-1 to Ci-Ni preceding coefficient Ci on the 
same line , up to a maximum of for example M most recent coefficients. It will be 
noted that in the band hH, IV coefficient CI has no preceding coefficients, C2 has only 
one preceding coefficient, and so on. For coefficient CI ai is set to a predetermined 

30 value K. For subsequent coefficients the set comprises the totality of preceding 
coefficients. 
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Thus cci is defined individually for each coefficient Ci to be modified. In the 
example above it is defined by the set of Ni unmodified coefficients preceding Ci. By 
choice of the appropriate function F, cri is adapted to the image such that image 
degradation can be minimised. In addition as will be discussed below in the section 
Remover, this allows cri to be recalculated from the watermarked image coefficients, 
after those have been restored to their original values. This improves the accuracy of 
restoring the original image. 

Referring to Figure 4 the illustrative procedure for calculating Ci' is as 
follows:- 

The calculation procedure starts at step S2. At step S4, i is initiates with value 0. At 
step S6, i is incremented by 1 to calculate al at step S8 for coefficient CI'. At step 
S10 the value of modified coefficient CT is calculated. The procedure then reverts to 
step S6 and i is incremented. The procedure continues until all coefficients have been 
modified. 

15 In addition, the calculation of a may be modified in one or both of the 

following ways:- 

1) If ai<aTL 5 it is incremented to cctl, where cctl is a lower threshold; and 
if ai> a™ it is reduced to a™, where octh is an upper threshold. 

2) The magnitude | C„ I of each coefficient is compared with a threshold Cth- 
20 If I C n | >Cth then Cn is not included in the calculation of ai,; or 

if I C n I >C T h C n is clipped to (C n / |C n |)C T H 

Watermark decoder and remover. Figures 5 and 6. 
Decoder Figure 5 

25 The operation of the watermark decoder 140 will now be explained in more 

detail with reference to Figure 5. The watermark decoder 140 receives the 
watermarked image I' and outputs the restored UMID. The watermark decoder 140 
comprises a wavelet transformer 310, a reference pseudo-random sequence (PRBS) 
generator 320, a correlator 330, a selector 340 and a error correction coding decoder 

30 350. The PRBS generated by the generator 320 is identical to that generated by the 
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PRBS generator 220 of Figure 2 and converted by a data converter (not shown ) to 
values +1 and -1 as described above. 

The wavelet transformer 310 receives the watermarked image F and, in known 
manner, outputs the modified wavelet coefficients Ci\ The correlator 330 receives the 
5 reference pseudo-random sequence PRBS having symbols Pi of values +1 and -1 from 
the pseudo-random sequence generator 320, and the wavelet coefficients Ci r and 
outputs a watermark image bit correlation sequence 335. The watermarked image bit 
correlation sequence is determined in the following way. 

The modified wavelet coefficients Ci'=CH-ajRj where Ri are bits of PRBS 
10 modulated by error-correction encoded bits Wj of UMID. In the example given above 
there are 51 1 bits Wj. Each bit Wj modulates L bits of PRBS. There are JL bits in the 
modulated PRBS. 

For each error correction encoded bit Wj, the correlater 330 calculates a 
correlation value 

jL+L 

15 S\= ^Cf.Pi 

where j = 0, 1, 2... T-l, and T is the number of error correction encoded bits. In 
this example T=5 1 1 . A sequence 335 of correlation values S) is produced. 

The correlation sequence 335 is received by the selector 340 which outputs an 
uncorrected UMID 345. The selector 340 outputs a bit value "1" for a value of S' 
20 greater than 0 and a bit value "0" for S' less than or equal to 0. The error correction 
code decoder 350 receives the uncorrected UMID 345 and in known manner outputs 
the restored UMID 145. 

The reference PRBS Pi is synchronised with the modulated PRBS in the 
watermarked image. For that purpose a synchroniser (not shown) is used. Such 
25 synchronisation is known in the art. 
Remover Figure 6. 

The watermark remover 130 receives the restored UMID 145, and the 
watermarked image I' and outputs a restored image I". The watermark remover 130 
comprises a pseudo-random sequence generator 420 for generating a reference pseudo- 
30 random sequence Pi identical to that produced by generators 220 and 320, a spread 
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spectrum signal generator 430 which produces, via a data converter 425, a restored 
watermark Ri' having bit values +1 and -1 from the restored UMID 145 and the 
pseudo-random sequence Pi. The reference sequence Pi is synchronised with the 
modulated sequence in the watermarked image in known manner. 

The watermark remover 130 further comprises a wavelet transformer 410 
which produces modified wavelet coefficients Ci' from the watermarked image F, a 
strength estimator 460 for calculating a and a combiner 440 which calculates restored 
wavelet coefficient values according to the equation 

Ci = Ci' - a;. Ri'. 

The restored wavelet coefficients C are fed to an inverse wavelet transformer 
450 which outputs the restored image I". 

Calculating ai. Figure 7. 

In accordance with the illustrative embodiment of the invention, cci is 
15 calculated in the embedder as described above in the section Calculating a The 
estimator 460 of the remover of Figure 6 recalculates a in analogous manner from 
coefficients Ci which have been restored to their original values. 

Thus referring for example to Figure 3A and to Figures 6 and 7, the modified 
coefficients Ci' are stored in a frame store 300 indicated as FS 3 in the wavelet 
20 transformer of Figure 6 in the same way as shown in figure 3A and they are serially 
ordered in the same way as described with reference to Figure 3 A. It will be recalled 
that coefficient Ci' has no preceding coefficients so al = k and CI = CI'- kRl. For 
each subsequent coefficient Ci, ai is calculable from the set Ni of preceding restored 
coefficients, all of which have been restored to their original value according to 
25 Ci = Ci' - ai. Ri\ 

Referring to Figure 7, the calculation procedure starts at step S5. At step S7, i 
is initiated to 0. At step S9, i is incremented by 1 to calculate al at step Sll for 
coefficient CI'. At step S13 the original value CI is calculated from coefficients CI'. 
The procedure then reverts to step S9 and i is incremented. The procedure continues 
30 until all coefficients Ci' have been restored to their original values Ci. 
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) As in the embedder of Figure 2, the calculation of a may be modified in one or 

both of the following ways:- 

1) If ai<ari, it is incremented to cctl, where cctl is a lower threshold; and 
if cti> ccth it is reduced to a™, where a™ is an upper threshold. 
5 2) The magnitude | Cn | of each coefficient is compared with a threshold Cth- 
If | Cn | >Cth then Cn is not included in the calculation of ai; or 
if I Cn | >C TH C n is clipped to (Cn/|Cn|)C T H- 



JLQ Modifications. 

As mentioned above the coefficients from which the value of ai is 
calculated may be in different bands to the related coefficient Ci which is to be 
modified or restored to its original value. Thus by way of example, referring to Figure 
8, the set of coefficients {Cn}i used to calculate ai of band hH, IV may be in the other 

15 bands. In the example of Figure 8 the set {Cn}i is shown as including coefficients Cli, 
C2i and C3i which are at positions related to the position of coefficient Ci. In this 
way, image properties in other bands are taken into account in calculating ai to ensure 
that the watermark is imperceptible. 

The coefficients Cli, C2i and C3i used to modify or restore Ci, may be 

20 coefficient which are never modified. That can be done by modifying only 
coefficients in one or more bands such as hH, IV and leaving the coefficients in other 
bands- unmodified. Alternatively at least some of the coefficients Cli, C2i and C3i 
used to modify or restore Ci may be modified. That can be done by storing the 
coefficients in a frame store 300 as shown in Figure 3 or 8 and by reading out 

25 coefficients in an order which allows the procedures of Figures 4 and 7 to be followed. 

It will be appreciated that whilst the foregoing discussion refers for ease of 
explanation to only 3 coefficients Cli, C2i and C3i in 3 bands in one level, in practice 
many more coefficients may be used and the coefficients may be in more than three 
bands and in more than one level. 

30 Other transforms 
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Whilst the invention has been described by way of example with reference to 
Wavelet transforms, it may be used with other transforms for example DCT. 
Other material 

Whilst the invention has been described by way of example with reference to 
material comprising video material ( still or moving images), it may be applied to other 
material, for example audio material and data material. 
PRBS 

As described hereinabove, the PRBS has a length of L J where J is the number 
of bits in a UMID. Thus each bit Wj of the UMID modulates a section of length L of 
the PRBS. Instead, it may have a length of L bits and be repeated for each bit j of the 
UMID. ~~ " ~~~~~ ~~ " 

Other Watermark data . 

Whilst the invention has been described by way of example with reference to 
1 5 UMIDs as the watermark data, it may be used with other data as the watermark. 
Using modified coefficients to calculate a 

The foregoing embodiment calculates a using unmodified coefficients. In 
alternative embodiments a is calculated using modified coefficients or a combination 
of modified and unmodified coefficients. The coefficients Ci are serially ordered. The 
20 coefficients used to calculate ai for coefficient Ci are coefficients preceding i on the 
serial order. 

^ Referring to Figures 2, and 6 frames stores FS1, FS2, FS3 and FS4 are 
provided in the wavelet transformer 210, the inverse wavelet transformer 250, the 
wavelet transformer 410 and the inverse wavelet transformer 450. Frame stores 
25 FSland FS4 store unmodified coefficients. Frame stores FS2 and FS3 store modified 
coefficients Ci. 

Thus there are available both at the encoder and at the remover serially ordered 
sets of unmodified and modified coefficients. 

In the embedder of Figure 2, as coefficients Ci in store FS 1 are modified, they 
30 are stored in FS2 as coefficients Ci'. Thus modified coefficients CV are. available to 
calculate ai. Thus the set {Cn}/ used to calculate ai for modifying coefficient Ci may 



5 
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comprise modified coefficients C preceding Cz optionally together with unmodified 
coefficients C preceding Cz. 

At the remover modified coefficients Cz' are stored in store FS3. Also the 
coefficients are restored, restored coefficients Cz are stored in store FS4. Thus 
5 modified coefficients CV are available to calculate cri optionally together with restored 
coefficients Ci. 

Sets of coefficients preceding a coefficient Cz or Cz' are present in all four 
frame stores FS1, FS2, FS3 and FS4. 
Shape of sets (Cn)z 

10 A set {Cn}/ may have any convenient shape. Where ai is calculated only from 

coefficients preceding Cz, the set may consist of coefficients immediately preceding 
Cz. Where the coefficients are raster scanned to serially order them, the set may 
consist of coefficients on the same scanning line as Cz. Alternatively, it may consist of 
coefficients on that line and a preceding line. Other shapes are possible. 
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) 

Wavelets 

Wavelets are well known and are described in for example " A Really 
5 Friendly Guide to Wavelets " by C Valens, 1 999 and available at www. 

Valens shows that the discrete wavelet transform can be implemented as an 
iterated filter bank as used in sub-band coding, with scaling of the image by a factor 
of 2 at each iteration. 

Thus referring to Figure 10, a spatial domain image is applied to a set of high 
10 pass HP and low pass LP filters. At level 1, the first stage of filtering, the image is 
filtered horizontally and vertically and, in each direction, scaled down by a factor of 2. 
In level 2, the low pass image from level 1 is filtered and scaled in the same way as in 
level 1 . The filtering and scaling may be repeated in subsequent levels 3 onwards. 

The result is shown schematically in Figure 9. Figure 9 is a representation 
normal in the art. The horizontal axis indicates increasing At level one the image is 
spatially filtered into four bands: the lower horizontal and vertical band, IHi, IV i; the 
upper horizontal band hHi, IV i; the upper vertical band IHi, hVi; and the upper 
horizontal and vertical band, hHi, hVi. At level 2, the lower horizontal and vertical 
band, IHi, IV i is filtered and scaled into the lower horizontal and vertical band, IH2, 
IV2; the upper horizontal band I1H2, 1V 2 ; the upper vertical band IH2, hV 2 ; and the 
upper horizontal and vertical band, I1H2, hV 2 . At level 3 (not shown in Figure 9), the 
lower horizontal and vertical band, 1H 2 , IV2 is further filtered and scaled. 
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) UMIDs 

The Unique Material Identifier or UMID is described in SMPTE Journal March 
2000. Referring to Figure 1 1 an extended UMID is shown. It comprises a first set of 
32 bytes of basic UMID and a second set of 32 bytes of signature metadata. 
5 The first set of 32 bytes is the basic UMID. The components are: 

•A 12-byte Universal Label to identify this as a SMPTE UMID. It defines the 
type of material which the UMID identifies and also defines the methods by which the 
globally unique Material and locally unique Instance numbers are created. 

•A 1-byte length value to define the length of the remaining part of the UMID. 
10 •A 3-byte Instance number which is used to distinguish between different 

'instances' of material with the same Material number. 

•A 16-byte Material number which is used to identify each clip. Each Material 
number is the same for related instances of the same material. 

The second set of 32 bytes of the signature metadata as a set of packed 
metadata items used to create an extended UMID. The extended UMID comprises the 
basic UMID followed immediately by signature metadata which comprises: 

•An 8-byte time/date code identifying the time and date of the Content Unit 
creation. 

•A 12-byte value which defines the spatial co-ordinates at the time of Content 
Unit creation. 

•3 groups of 4-byte codes which register the country, organisation and user 

codes. 

Each component of the basic and extended UMIDs will now be defined in turn. 
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) 

The 12-byte Universal Label 

The first 12 bytes of the UMID provide identification of the UMID by the 
registered string value defined in table 1 . 



Byte No. 


Description 


Value (hex) 


1 


Object Identifier 


06h 


2 


Label size 


OCh 


-1 


Designation: ISO 


2Bh 


4 


Designation: SMPTE 


34h 


5 


Registry: Dictionaries 


Olh 


6 


Registry: Metadata Dictionaries 


Olh 


7 


Standard: Dictionary Number 


Olh 


8 


Version number 


Olh 


9 


Class: Identification and location 


Olh 


10 


Sub-class: Globally Unique Identifiers 


Olh 


11 


Type: UMID (Picture, Audio, Data, Group) 


01, 02, 03, 04h 


12 


Type: Number creation method 


XXh 



5 



Table 1: Specification of the UMID Universal Label 
The hex values in table 1 may be changed: the values given are examples. 
Also the bytes 1-12 may have designations other than those shown by way of example 
in the- table. Referring to the Table 1, in the example shown byte 4 indicates that bytes 

10 5-12 relate to a data format agreed by SMPTE. Byte 5 indicates that bytes 6 to 10 
relate to "dictionary" data. Byte 6 indicates that such data is "metadata" defined by 
bytes 7 to 10. Byte 7 indicates the part of the dictionary containing metadata defined 
by bytes 9 and 10. Byte 10 indicates the version of the dictionary. Byte 9 indicates 
the class of data and Byte 10 indicates a particular item in the class. 

15 In the present embodiment bytes 1 to 10 have fixed preassigned values. Byte 

1 1 is variable. Thus referring to Figure 12, and to Table 1 above, it will be noted that 
the bytes 1 to 10 of the label of the UMID are fixed. Therefore they may be replaced 
by a 1 byte 'Type 5 code T representing the bytes 1 to 10. The type code T is followed 
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by a length code L. That is followed by 2 bytes, one of which is byte 1 1 of Table 1 
and the other of which is byte 12 of Table 1, an instance number (3 bytes) and a 
material number (16 bytes). Optionally the material number may be followed by the 
signature metadata of the extended UMID and/or other metadata. 
5 The UMID type (byte 11) has 4 separate values to identify each of 4 different 

data types as follows: 

'Olh' = UMID for Picture material 

'02h' = UMID for Audio material 

'03h' = UMID for Data material 
10 '04h' = UMID for Group material (i.e. a combination of related essence). 

The last (12th) byte of the 12 byte label identifies the methods by which the 
material and instance numbers are created. This byte is divided into top and bottom 
nibbles where the top nibble defines the method of Material number creation and the 
bottom nibble defines the method of Instance number creation. 
15 Length 

The Length is a 1-byte number with the value 4 13h' for basic UMIDs and '33h' 
for extended UMIDs. 

Instance Number 

The Instance number is a unique 3 -byte number which is created by one of 
20 several means defined by the standard. It provides the link between a particular 
'instance' of a clip and externally associated metadata. Without this instance number, 
all material could be linked to any instance of the material and its associated metadata. 

The creation of a new clip requires the creation of a new Material number 
together with a zero Instance number. Therefore, a non-zero Instance number 
25 indicates that the associated clip is not the source material. An Instance number is 
primarily used to identify associated metadata related to any particular instance of a 
clip. 

Material Number 

The 1 6-byte Material number is a non-zero number created by one of several 
30 means identified in the standard. The number is dependent on a 6-byte registered port 
ID number, time and a random number generator. 
Signature Metadata 
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Any component from the signature metadata may be null-filled where no 
meaningful value can be entered. Any null-filled component is wholly null-filled to 
clearly indicate a downstream decoder that the component is not valid. 

The Time-Date Format 
5 The date-time format is 8 bytes where the first 4 bytes are a UTC (Universal 

Time Code) based time component. The time is defined either by an AES3 32-bit 
audio sample clock or SMPTE 12M depending on the essence type. 

The second 4 bytes define the date based on the Modified Julian Data (MJD) as 
defined in SMPTE 309M. This counts up to 999,999 days after midnight on the 17th 
1 0 November 1 858 and allows dates to the year 4597. 

The Spatial Co-ordinate Format 

The spatial co-ordinate value consists of three components defined as follows: 

•Altitude: 8 decimal numbers specifying up to 99,999,999 metres. 

•Longitude: 8 decimal numbers specifying East/West 180.00000 degrees (5 
1 5 decimal places active). 

•Latitude: 8 decimal numbers specifying North/South 90.00000 degrees (5 
decimal places active). 

The Altitude value is expressed as a value in metres from the centre of the earth 
thus allowing altitudes below the sea level. 
20 It should be noted that although spatial co-ordinates are static for most clips, 

this is not true for all cases. Material captured from a moving source such as a camera 
mounted on a vehicle may show changing spatial co-ordinate values. 

Country Code 

The Country code is an abbreviated 4-byte alpha-numeric string according to 
25 the set defined in ISO 3 166. Countries which are not registered can obtain a registered 
alpha-numeric string from the SMPTE Registration Authority. 
Organisation Code 

The Organisation code is an abbreviated 4-byte alpha-numeric string registered 
with SMPTE. Organisation codes have meaning only in relation to their registered 
30 Country code so that Organisation codes can have the same value in different 
countries. 
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User Code 

The User code is a 4-byte alpha-numeric string assigned locally by each 
organisation and is not globally registered. User codes are defined in relation to their 
registered Organisation and Country codes so that User codes may have the same 
value in different organisations and countries. 
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CLAIMS 

1 . A method of removing data embedded in material comprising the steps of: 
receiving material in which data is embedded; 

accessing an information store storing information enabling the data to be 
removed; and 

removing the said data using the enabling data accessed from the store. 

2. A method comprising the steps of: 
embedding data in material; and 

storing3-4n^an^4^^ for e na blin g t h e d ata to be 



removed from the material. 

3. Apparatus for removing data embedded in material comprising: 
an input for receiving material in which data is embedded; 
an information store for storing information enabling the data to be removed; 

and 

a remover arranged to remove the said data using the enabling data 
accessed from the store. 



4. Apparatus comprising: 

an embedder for embedding data in material; 
- a store for storing information for enabling the data to be removed from the 
material; and 

a generator for generating the enabling information when the said data is 
embedded in the material. 

5. A method comprising the steps of: 

embedding data in first material to produce second material in which the data is 
embedded; 

removing the data from the second material to produce recovered material; 
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comparing the first and recovered material to determine any differences and 
locations of differences therebetween; and 

storing corrections which correct the said differences and data identifying the 
said locations in the first material at which the differences occur. 

6. A method according to claim 5, further comprising applying a channel 
emulation , to the second material, wherein the removing step produces recovered 
material from the second material to which the emulation function has been applied. 

7. A method according to claim 5 or 6, wherein the storing step comprises 
storing, as the said corrections, the values of the first material which values occur at 
the said locations in the first material. 

8. A method according to claim 5 or 6, wherein the storing step comprises 
storing the said differences. 

9. A method according to claim 5, 6, 7 or 8, further comprising storing 
identification data which identifies the said material. 

10. A method according to claim 5, 6, 7, 8 or 9, wherein the said 
identification data comprises an identifier which identifies the second material. 

11. A method of removing data embedded in material, the data being 
embedded in the material according to the method of any one of claims 5 to 10, the 
removing method comprising the steps of: 

removing the data from the said second material to produce recovered material; 

retrieving the said corrections and locations from a store storing the said 
corrections which correct the said differences and the said data identifying the said 
locations in the first material at which the differences occur; and 

using the said retrieved corrections to correct the recovered material at the said 
locations identified by the identifying data. 
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12 A method according to claim 10 when dependent on claim 6, wherein 
the using step comprises replacing the values of the recovered material at the said 
locations identified by the identifying data by the stored values of the first material at 
the said locations. 

5 

13. A method according to claim 11 when dependent on claim 8, wherein 
the using step comprises correcting the values of the recovered material at the said 
locations using the said stored differences. 

10 14. A method according to claim 1 1, 12 or 13, when dependent on claim 9 

further comprising the step of determining the identifier of the second material and 
retrieving from the said store the corrections and locations associated with the 
identified material. 

15 15. Apparatus comprising : 

an embedder for embedding data in first material to produce second material in 
which data is embedded; 

a remover for removing the data from the second material to produce recovered 
material; 

20 a comparator for comparing the first and recovered material to determine the 

differences, and locations of differences, therebetween; and 

a store for storing data identifying the said locations and corrections which 
correct the said differences. 

25 16. Apparatus according to claim 15, further comprising a channel emulator 

between the embedder and remover. 

17. Apparatus for removing data from material in which the data has been 
embedded by the apparatus of claim 14, the removing apparatus comprising: 
30 a remover for removing the data from the said second material to produce 

recovered material; 



1-00-87, P9757GB 25 

a deliver for deriving the said corrections and identifying data from the said 
store; and 

a corrector arranged to use the stored corrections to correct the recovered 
material at the said locations identified by the identifying data. 

5 

18. A system comprising embedding apparatus according to claim 15 or 16, 
and removing apparatus according to claim 15 linked by a channel. 

19. An apparatus or method according to any one of claims 1 to 18, 
1 0 wherein the said material is video material. 



20. An apparatus or method according to any one of claims 1 to 18, 
wherein the said material is audio material. 



15 21. An apparatus or method according to any one of claims 1 to 18, 

wherein the said material is audio- visual material 



22. An apparatus or method according to anyone of claims 1 to 21, wherein 
the said data includes a UMID. 

20 

23. A computer program product arranged to carry out the method of any 
one of claims 1 to 9 and 1 9 to 22 when run on a computer. 

24. A computer program product arranged to carry out the method of any 
25 one of claims 1 1 to 14 and 19 to 22 when run on a computer. 

25. A method of embedding data in material substantially as hereinbefore 
described with reference to Figure 1 optionally together with one or more of Figures 2 
to 12. 
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ABSTRACT 

EMBEDDING AND REMOVING DATA FROM MATERIAL 

A method of embedding data in material comprises the steps of: 
embedding data in original material to produce data embedded material; 
5 removing the watermark from the data embedded material to produce 

recovered material; 

comparing the original and recovered material to determine the differences and 
locations of differences therebetween; and 

storing the said locations and corrections which correct the said differences. 
10 A method of removing the data embedded in the material, comprises the steps 

of: 

removing the data from the material to produce recovered material; 

deriving the said corrections and locations from the said store; and 

using the corrections to correct the recovered material at the said locations. 

15 



[Figure 1] 
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